Phylogenetic and protein prediction analysis reveals the taxonomically diverse distribution of virulence factors in Bacillus cereus strains

Bacillus cereus is a food contaminant with widely varying enterotoxic potential due to its virulence proteins. In this article, phylogenetic analysis of the amino acid sequences from the whole-genomes of 41 strains, evolutionary distance calculation of the amino acid sequences of the virulence genes, and functional and structural predictions of the virulence proteins were performed to reveal the taxonomically diverse distribution of virulence factors. The genome evolution of the strains showed a clustering trend based on the protein-coding virulence genes. The strains of B. cereus have evolved into non-toxic risk and toxic risk clusters with medium-high- and medium-low-risk subclusters. The evolutionary transfer distances of incomplete virulence genes relative to housekeeping genes were greater than those of complete virulence genes, and the distance values of HblACD were higher than those of nheABC and CytK among the complete virulence genes. Cytoplasmic localization was impossible for all the virulence proteins, and NheB, NheC, Hbl-B, and Hbl-L1 were predicted to be extracellular. Nhe and Hbl proteins except CytK had similar spatial structures. The predicted structures of Nhe and Hbl mainly showed ‘head’ and ‘tail’ domains. The ‘head’ of NheA and Hbl-B, including two α-helices separated by β-tongue strands, might play a special role in the formation of Nhe trimers and Hbl trimers, respectively. The ‘cap’ of CytK, which includes two ‘latches’ with many β-sheets, formed a β-barrel structure with pores, and a ‘rim’ balanced the structure. The evolution of B. cereus strains showed a clustering tendency based on the protein-coding virulence genes, and the complete virulence-gene operon combination had higher relative genetic stability. The beta-tongue or latch associated with β-sheet folding might play an important role in the binding of virulence structures and pore-forming toxins in B. cereus.


Introduction
Bacillus cereus (B. cereus), which is one of twelve closely related species in the Bacillus cereus group [1], is a Gram-positive bacterium occurring ubiquitously in nature with widely varying pathogenic potential [2]. Cells are rod-shaped, with some in chains or occasionally long a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 filaments, and are aerobic or facultative anaerobic. Most species will grow on common media such as nutrient agar and blood agar. They are characteristically large (2-7 mm in diameter) and vary in shape from circular to irregular, with matt or granular textures, while smooth and moist colonies are also common. B. cereus, the spores of which can survive at high temperatures and germinated vegetative cells of which can multiply and produce toxins under favorable conditions, is recognized as the most frequent cause of food-borne disease [3]. Its toxins cause two distinct forms of food poisoning, the emetic type (uncommon) and the diarrheal type (common). Diarrheal strains produce three enterotoxins, which belong to the family of pore-forming toxins: nonhemolytic enterotoxin (Nhe), hemolysin BL (Hbl), and cytotoxin K (CytK). Nhe comprises three proteins, NheA, NheB, and NheC, encoded by one operon containing one of three genes, namely, nheA, nheB, and nheC, respectively. Hbl consists of a single B-component (encoded by hblA) and two L-components, L 1 (hblC) and L 2 (hblD), all of which are essential for activity, with no individual or pairwise activity [4]. CytK (cytK) is a singlecomponent toxin [5]. The genes that encode NheABC can be detected in nearly all enteropathogenic B. cereus strains, hblACD can be detected in approximately 45% to 65% of such strains, and cytK is less prevalent [6,7]. B. cereus species, which were compared on the basis of 16S rRNA (identity values >98%), were closely homologous to each other [8]. Nevertheless, the suitability of this marker for the classification of B. cereus might be limited, as it is unable to effectively distinguish between the closely related species [9]. Some papers have reported that the species affiliation of B. cereus group, which could lead to an exchange of virulence plasmids between species, often does not match patterns of phylogenetic relatedness [10,11]. While the enterotoxins of B. cereus are chromosome-coded, the unique characteristics are observed for plasmids and are thus present throughout the B. cereus group [12]. Lapidus et al. reported a large plasmid with an operon encoding all three Nhe components in a B. cereus strain [13]. There is evidence that extensive gene exchange occurs between plasmids and the chromosome during the evolution of the B. cereus group [14]. Therefore, some genes encoded on plasmids can spread via horizontal gene transfer among B. cereus and the transfer of a single plasmid from one species to another [15]. Didelot et al. detected three phylogenetic groups (clades) in a study on the evolution of pathogenicity in the B. cereus group [16]. B. cereus, as a genomospecies, could be mainly found in clade two based MLSA and multiple comparative analysis of ANI values [17]. Later, seven major phylogenetic groups with ecological differences were identified in the B. cereus group [10]. A recent study suggested that nine phylogenetic clades of isolates may be better for assessing the risk of diarrheal foodborne disease caused by B. cereus group isolates [18]. These studies of virulence factors of B. cereus concern the evolutionary classification of virulence genes, and there have been few comparative analyses of the relative evolutionary distance of virulence genes and the prediction of virulence protein function and structure. In this study, the genome, virulence gene sequences and predicted virulence proteins of 41 B. cereus strains were comparatively analyzed. This work aims to examine the species diversity of B. cereus strains and the phylogenetic relationships among virulence factors, to systematically evaluate the distribution of virulence genes, and to comparatively analyze the structures and functions of virulence proteins.

Characterization of B. cereus strains
Forty-one strains of B. cereus with complete-and chromosome-level assemblies in the National Center for Biotechnology Information (NCBI) database were selected for comparative analysis. Datasets that passed the completeness test (acceptable level is >85%) and contamination test (acceptable level is <5%) were composed of sequences submitted on deadline March 5, 2021.
Thirty-one pathogenic and ten nonpathogenic strains that were isolated from food, patients, the environment, and unknown sources, were eligible, along with a control strain, Sporolactobacillus terrae 70-3 (S. terrae 70-3), that belonged to a different genus. In terms of evolution, S. terrae which has a defined taxonomic and phylogenetic status, is closely related to B cereus in Bacillales. The sequences and annotation information for the stains were downloaded from the NCBI (details in Table 1).

Quality assessment of genomic sequences
The contamination and completeness of the metagenomic sequences were evaluated by CheckM software version v1.1.3 [19].

Phylogenetic and average nucleotide identity (ANI) analysis
The first phylogenetic tree, based on whole-genome amino acid sequences of each strain, was constructed by using the CVTree4 webserver (http://cvtree.online/v4/prok/index.html), which constructs whole-genome-based phylogenetic trees without sequence alignment by using a composition vector (CV) approach, and the K-tuple length was 6 [20]. Every genome sequence was represented by a composition vector, which was calculated as the difference between the frequencies of k-strings and the prediction frequencies by the Markov model [21]. The shape and text content of the phylogenetic tree were modified by Molecular Evolutionary Genetics  [22]. In this study, the same genome sequence data were subjected to ANI analysis to verify the significance of the first phylogenetic tree. ANI analysis was performed using JSpeciesWS Online Service (http://jspecies.ribohost.com/jspeciesws/) as described by Richter et al. [23]. The distance matrix, which was calculated by the distance value (DV) using the formula DV = 1-[ANIb value], was used to construct the second phylogenetic tree, which was generated from the resulting Newick format file using Njplot [24]. The formula was balanced using the mean value method and was subjected to calculation using DrawGram in the PHYLIP package version 3.695 [25]. To determine the associations between each protein-coding gene and the different clusters, statistical enrichment analyses were conducted with PhyloGLM V2.6 [17].

Multilocus sequence analysis (MLSA)
A total of forty-one strains containing gene sequences, which were downloaded from the NCBI, were found and further analyzed for the presence of seven housekeeping and three enterotoxin genes. The housekeeping genes adenylate kinase (adk), catabolite control protein A (ccpA), glycerol uptake facilitator protein (glpF), glycerol-3-phosphate transporter (glpT), pantoate-beta-alanine ligase (panC), phosphate acetyltransferase (pta), and pyruvate carboxylase (pyc) were chosen to calculate the basic evolutionary distances of the species. These housekeeping genes, scattered across the entire chromosome, are suitable for MLSA [26]. The types of enterotoxin genes (nhe, hbl, and cytK) were divided into different groups, and the base sequences were concatenated for further MLSA. Thus, rearrangement of genes was unnecessary because the order of the genes within the operons was conserved in all strains. The distances of concatenated genes were calculated in MEGA X using the maximum likelihood (ML) algorithms, which are based on the Tamura-Nei model with a discrete gamma distribution [22]. The model applied for MLSA of DVs was the ideal substitution model according to the 'find best DNA/Protein models' function [27]. The housekeeping genes were of the same length in all strains, as were the different virulence genes. The same settings for the calculation of all phylogenetic DVs were used to ensure comparability of the results. We calculated the relative changes in genetic DVs between the virulence genes, which were concatenated housekeeping genes minus the simple housekeeping gene, representing the change in virulence gene transfer.
varied from 5.20 to 6.47 MB. The G+C contents of the forty-one genomes ranged from 34.70% to 35.75%. Compared with the control genome from S. terrae 70-3 (3.31 MB and 45.30%), the genomes of B. cereus were much larger and had lower G+C contents. The contamination and completeness of the sequences were 0-2.02% and 89.81%-98.99%, respectively (shown in Table 2). These results suggested that these sequences are of high quality, have low contamination (values <2.02%, acceptable level is <5%), and have high completeness (values >89.81%, acceptable level is >85%); thus, they were appropriate for analysis. In this study, the strains originated from food (5/41), the clinic (12/41), the environment (17/41), and undetermined sources (7/41). The enterotoxic risk potential based on the virulence genes of forty-one B. cereus strains is listed in Table 2. Enterotoxicity, which was reflected by virulence gene numbers, was categorized into levels of three types (10/41), two types (14/41), one type ( Table 2).

Phylogenetic analysis based on whole amino acid sequences
Two whole-genome-based methods were used to construct phylogenetic trees. The first phylogenetic tree was constructed with the CV method using the whole amino acid sequences of forty-one B. cereus strains and the outgroup species S. terrae 70-3 [35]. To ensure the accuracy of the results, we added the inbuilt sequence AH1273 and the sequence of S. terrae 70-3 (control) from the webserver database (Fig 1). According to enterotoxic risk potential, the fortyone strains of B. cereus had evolved into five distinct clusters, which were likely risk regions I, IV and V and nonrisk regions II and III. However, there were individual nonconformities, such as nontoxicity of BDRD ST196 in region V. Region I was dominated by medium-and high-risk strains (15/17) but also included two low-risk strains (BHU1 and CO1-1), region II and III included only nonrisk strains (9/9), and regions IV and V were dominated by mediumand low-risk strains (13/15) but also included AH820 (high-risk strain) and BDRD ST196 (nonrisk strain), respectively. To verify the above results and obtain more accurate molecular evolutionary relationships, we established a second phylogenetic tree based on ANI analysis (Fig 2). According to enterotoxic risk potential, the forty-one strains of B. cereus had evolved into six distinct regions: likely risk regions A, C, E, and D 2 and nonrisk regions B and D 1 . The two phylogenetic trees were similar in terms of the regions where enterotoxic risk was likely. The only difference between the two phylogenetic trees was a change in the evolutionary cluster of two strains. The ATCC 10987 and ATCC 4342 strains, which belonged region IV in the first tree, were assigned to region D in the second tree. Region A, which was dominated by medium-high-risk strains (15/17) but also included two low-risk strains (BHU1 and CO1-1), was the same as region I. Regions B and D 1 , which included only nonrisk strains (9/9), were the same as regions II and III. Regions C, D 2 and E, which were also dominated by medium-and low-risk strains (13/15) but included AH820 (high-risk strain in C) and BDRD ST196 (nonrisk strain in E), were the same as regions IV and V. By taking advantage of the updated enterotoxic risk regions found in the current B. cereus strains, we decided to use a statistical approach to evaluate whether the occurrence of virulence factor-encoding genes (detailed in Table 3) correlate with a particular region. We observed that nheABC was significantly present in region C (p < 0.05), and hblA and nheC were significantly present in region A and C (p < 0.05), respectively. The results showed that nheABC and nheC were significantly enriched in the medium-low-risk region, and hblA was significantly enriched in the medium-high-risk region.

Phylogenetic distance analysis based on concatenated housekeeping and virulence genes
To analyze the evolution and phylogenetic relationships of virulence gene transfer in relation to DVs, the nheABC, hblACD, and cytK genes of the forty-one strains, which need to be compared to the housekeeping genes of the strains, were studied. To this end, we concatenated the sequences of virulence proteins from the strains and seven housekeeping proteins (Adk-CcpA-GlpF-GlpT-PanC-Pta-Pyc) from the B. cereus core genome. The genetic DV of virulence gene transfer was evaluated by calculating the average difference in the phylogenetic DV of the ATCC14579 strain compared with forty other strains. The hblD and hblAD virulence proteins, which were observed in only two strains, were excluded. As shown in Table 2, the relative genetic DVs were calculated for nheAB (10/41), hblCD (6/41), nheABC (29/41), hblACD (19/41), and CytK (17/41). As shown in Fig 3, the average evolutionary DVs of virulence gene transfer from high to low were 0.015 (nheAB), 0.012 (hblCD), 0.005 (hblACD), 0.003 (nheABC) and 0.001 (cytK). The DVs of incomplete virulence genes (nheAB and hblCD) were higher than those of complete virulence genes (nheABC, hblACD, and CytK). The average evolutionary DV of nheAB was higher than the DV of hblCD among the incomplete virulence genes; the DV of hblACD was the highest and that of cytK was the lowest among the complete virulence genes.

Comparative prediction analysis of the function and structure of virulence proteins
As shown in Table 4 and Fig 4, we obtained the scores of the seven virulence proteins for subcellular localization prediction. The scores of NheB, NheC, Hbl-B, and Hbl-L 1 were all 9.73, and that of CytK was 9.98, all consistent with extracellular localization. The localization of NheA and Hbl-L 2 was unknown because the scores were all lower in the cytoplasmic membrane (3.33/4.6), cell wall (3.33/2.48), and extracellular space (3.33/2.92), making it impossible for the virulence proteins to appear in the cytoplasm. NheB and Hbl-L 1 had two helices, which were transmembrane region sequences 235-257/267-286 and 239-261/268-290, and NheC had only one helix, of which the transmembrane region was 228-250, but the others had none. The virulence protein cleavage sites of all strains were in the sequence 30-32 with 0.93-0.99 likelihood levels, except NheA, for which the site was in the 26-27 sequence (0.81 likelihood level). The amino acid sequences of Nhe, Hbl and CytK contained N-terminal signal peptides for secretion (< 31 amino acids). The signal peptide start-end was between 1 and 31 sequences but not found for Hbl-B and Hbl-L 1 were not found, and the domain start-end was between 35 and 329 sequences.
Examination of the phylogenetic tree constructed using the Hbl, Nhe, and CytK sequences of ATCC 14579 (Fig 5) showed that NheA and Hbl-L 2 , as well as NheBC and Hbl-L 1 , were more closely related to one another than to the other components, and CytK was the least evolutionarily related. This result was also reflected in the evaluation parameters of the 3-D enterotoxin protein structures. As shown in Table 5, the closest template for NheB and NheC was Hbl-L 1 (sequence identity of 40.82% and 36.83%, respectively), and that for Hbl-L 2 was NheA (24.85%). The closest template for CytK was alpha-hemolysis (30.39%), as expected, with considerable amino acid sequence homology to S. aureus leukocidin [36]. The templates

PLOS ONE
Virulence factor distribution in Bacillus cereus of NheA, and Hbl-L 1 were included in the SWISS-MODEL server with high sequence identity (97.22% and 99.73%, respectively), and Hbl-B had acceptable sequence identity (71.99%). The sequence coverage and range of all structures were 0.71-0.93 and 33-439, respectively, with GMQE evaluation values (0.54-0.88) above 0.5, which indicated reliable model construction. Each residue is allotted a reliability score between 0 and 1, indicating the expected resemblance to the native structure. Higher numbers represent higher reliability of the residues [37]. In general, a sequence identity of >30% for each template was acceptable based on the SWISS-MO-DEL server. To verify the above results (especially the sequence identity of Hbl-L 2 , which was 24.85%) and obtain the predicted Nhe-trimer and Hbl-trimer structures, we used AlphaFold software for secondary structural prediction. The results were acceptable, the predicted localdistance difference test (plDDt) values of monomers were 81.90-94.14, and the scores of Nhe trimers and Hbl trimers were 0.68 and 0.36 (ipTM+pTM), respectively [33,34].
Due to the sequence similarity of NheB and NheC with Hbl-B, homology models based on the Hbl-B structure were established. As shown in Fig 6, NheA and Hbl-B had highly similar structures (Fig 6A), and the NheA, NheB, NheC, HBl-B, HBl-L 1 , and HBl-L 2 structures showed that there were two main domains, a 'head' and 'tail' (Figs 6B-6D, 6F-6H, and 7A-7F). The main body of the structure was formed by the 'tail' domain, which consisted of five major helices, and the 'head' domain of NheA included two long α-helices separated by β-tongue strands (Figs 6B and 7A). Multiple β-tongue strands were detected in Hbl-B (Fig 7D) but are not shown in Fig 6F because of the prediction method. Another difference was the 'head' of Hbl-L 2, possibly related to the low sequence identity (Figs 6H and 7F). The 'latch' with many β-sheets of CytK folded the 'cap' domain, which was the toxic area (Figs 6E and 7G). The amino 'latch', which included a short helix in all known pore structures, was observed on the top of the conformation, which extended into the pore to form a β-barrel and was folded into a stranded antiparallel β-sheet in the monomer. Although the amino 'latch' protrudes and interacts with the adjacent protomer in the pore, it is located at the edge of the β-sheet of the 'cap' region [39]. The 'rim' domain, which was composed of three strands of short β-sheets, formed the main body of the balanced structure. The trimers of Nhe and Hbl were horizontally arranged. The Hbl trimer (arranged in the sequence B, -L 1 , -L 2 ) was more similar than the Nhe trimer (arranged in the sequence A, B, C) based on the structural features. The β-tongue strands of Hbl-B and NheA might play an important structural and functional role in the formation of trimers.

Discussion
In this study, forty-one strains of B. cereus were subjected to phylogenetic analyses based on whole amino acid sequences. Enterotoxicity, which was evaluated on the basis of nheABC, hblACD, and cytK gene expression, was classified into levels of three types, two types, one type, and no types. In terms of evolutionary relationships, clusters of virulence and nonvirulence gene strains were evident, and the regional distribution of the number of types of virulence genes was also presented, further confirmed by ANI-based phylogenetic analyses. We found that the two phylogenetic trees were similar. All non-toxic-risk strains were concentrated in two clusters, and all but two of the medium-high-and medium-low-toxic-risk strains formed clusters. The results suggest the possibility of virulence gene transfer, which may be related to frequent exchange of pathogenicity factors during B. cereus virulence evolution, including socalled probiotic or nonpathogenic species [15]. Previous taxonomic results for the B. cereus  Table 2.
https://doi.org/10.1371/journal.pone.0262974.g002 Table 3 group are largely based on inadequate criteria such as virulence characteristics, which residing on virulence plasmids [9]. Due to rampant horizontal gene transfer in bacterial ecosystems, increasing numbers of "core" genes should be found and defined based on refined species classification [41]. Recently, phylogeny-aware methods based on linear regression models were applied at the whole-genome scale to study the genomes of bacteria [17]. By using this statistical approach, we hereby observed that the virulence genes nheABC and nheC positively correlated with enrichment in the medium-low-risk cluster, and hblA was found in the mediumhigh-risk cluster. The inconsistent evolutionary distribution of individual virulence genes may be due to other factors, which needs further study.
The Bacillus hemolytic and nonhemolytic enterotoxin family of proteins consists of several Bacillus enterotoxins, which can cause food poisoning in humans [42]. Hemolytic BL and   [43]. Our MLSA results showed that in the process of toxin molecular evolution, there were differences between the results for complete and incomplete virulence proteins, and two toxic-type genes had a more significant effect in relation to DVs than three toxic-type genes. The results suggested that the complete virulence-gene operon combination has higher relative genetic

PLOS ONE
stability. The DV of hemolysin Bl was greater than that of nonhemolytic cytotoxin K. nheABC, which was responsible for most of the cytotoxic activity of B. cereus isolates, showed stable, strictly vertical inheritance [44]. In contrast to hbl, duplication or deletion of nhe, which was The global model quality estimate (GMQE) is a quality estimate that combines properties from the target-template alignment and the template structure [38]. plDDt: Predicted local-distance difference test [29]; pTM: Predicted TM score; ipTM: Interface pTM; ipTM+pTM: Evaluation result of multimer prediction [34].
https://doi.org/10.1371/journal.pone.0262974.t005 almost exclusively transmitted vertically, was rarely observed, and cytK, a one-type gene, had the highest relative genetic stability [15]. Bazinet revealed significant associations of particular genes with phenotypic traits shared by groups of taxa [41]. Currently, it is commonly accepted that the toxicity potential of B.

PLOS ONE
cereus is not driven by enterotoxin gene types because the expression of enterotoxin genes is highly complex and probably strain-specifically affected by transcription, posttranscriptional and posttranslational modification [45][46][47]. Carroll et al. suggested that further classification and descriptions of phenotypes should be added on the basis of genotype classification in B. cereus [2]. In this study, both Nhe and Hbl are three-component cytotoxins composed of binding components A and B and two lytic components B, C and -L 1 , -L 2 , with all three subunits acting synergically to cause illness. The amino acid sequences of all Nhe, Hbl and Cytk components containing N-terminal signal peptides indicated toxin secretion via the secretory translocation pathway. The final positions of Hbl-B, Hbl-L 1 , NheB, NheC, and CytK were all extracellular and did not appear in the cytoplasm, and the two transmembrane regions of NheB and Hbl-L 1 might be responsible for transporting the assembled three-component cytotoxins across the membrane to complete the toxic effect. Dietrich et al. found that the factor triggering enterotoxin production under simulated intestinal conditions by various cell lines from different organisms and compartments was independent of cell differentiation [48]. Similarities were found when predicted transmembrane helices were compared. NheA and Hbl-B had no such helices, and NheB and Hbl-L 1 had two that may play an important role in molecular docking and transmembrane activities. Furthermore, the Nhe components seem to be additionally processed in the extracellular space after separation from the signal peptide for secretion [48]. The difference is that NheC had one such component and Hbl B had none, which may strengthen the secretion of the Nhe protein.
The Nhe and Hbl proteins share sequence similarities, both between the three components of each complex and between the two enterotoxin complexes [12]. The structural and functional properties were consistent with those of the superfamily of pore-forming cytotoxins of Hbl and Nhe [49][50][51]. The NheA, NheB, NheC, HBl-B, HBl-L 1 , and HBl-L 2 structures showed two main domains, a 'head' and 'tail'. The 'heads' of NheA and Hbl-B, including two α-helices separated by β-tongue strands, play a special role in Nhe trimers and Hbl trimers, respectively. Upon contact with lipids, cell membranes or detergents, the protein oligomerizes and forms ring-shaped structures acting as transmembrane pores [52,53], and the hydrophobic β-tongue is assumed to be inserted into the membrane first [54]. It is worth noting that NheB, NheC, Hbl-L 1 , and Hbl-L 2 had few or no β-strands in the 'head', which were either responsible for conformational changes of NheA and Hbl-B or for the stabilization of the 'head' domain [50] or might lead to reduced toxicity or only ligand function [41]. The difference was reflected in the triplet prediction results, i.e., a significant difference in structural arrangement compactness between Nhe trimers (noncompact type) and Hbl trimers (compact type). Ganash et al. speculated that the Nhe trimer requires interaction with unknown proteins of an additional function [41]. A specific binding order of the three Nhe and Hbl components is also necessary for pore formation [55,56]. We found that NheB and Hbl-L 1 were close to NheA and Hbl-B in the predicted trimer structure. Didier et al. found that NheA is important for attaching to cellbound NheB and NheC and that NheB is the main interaction partner of NheA [57], and a further correlation was found for the amounts of Hbl B and Hbl L 1 [46]. Cytotoxin K is a single protein with β-barrel pore-forming toxin in contrast to the tripartite toxin complexes Hbl and Nhe. The CytK structure, which exhibits two 'latches' with many β-sheets folded beside the 'cap' domain forming a β-barrel, was the pore structure on top of the conformation. The 'rim' region, which was folded into a three-stranded antiparallel β-sheet, balanced the structure in the monomer. The predicted structure revealed that CytK was likely to belong to the leukocyte toxin family. These monomers diffuse to target cells and are attached to them by specific receivers [58], which are lipids and proteins that cause lysis of red blood cells by destroying their cell membrane [59].

Conclusion
In this study, we describe the molecular evolution, function and structural diversity of virulence factors in B. cereus strains. The evolution of B. cereus strains showed a clustering trend based on the coding virulence genes. The complete virulence gene operon combination had higher relative genetic stability than the incomplete operon. The two α-helices in the 'head' of the NheA and Hbl-B structures, which are separated by β-tongue strands, and two 'latches' with many β-sheets folded beside the 'cap' of the CytK structure might play a special role in the binding of virulence structures and pore-forming toxins in B. cereus. Overall, the exact mechanism by which B. cereus causes diarrhea remains unknown, but our results provide helpful information for better understanding the taxonomically diverse distribution of virulence factors in B. cereus strains.

Formal analysis: Jun Liu.
Funding acquisition: Ming Zhang.